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In these lectures I propose to push Einsteins principle of coordinate inde- 
pendence to the extreme in order to restrict the possible form of fundamental 
equations of motion in physics. I start from nearly tautological system theo- 
retic axioms. They provide a minimal amount of a priori structure which is 
thought to be characteristic of human thinking in general. It is shown how 
^ . formal discretizations of Maxwell and Yang Mills theory in flat space and of 

' general relativity in Ashtekar variables fit into this frame work. 

m ■ 
o 

^ ; 1 What distinguishes truly fundamental physi- 

cs ! cal theories? 

a^ 

Q I The purpose of science is complexity reduction. We wish to understand a mul- 

titude of emergent phenomena starting from few basic principles. This tells us 
j I what criterion could be used to distinguish between more and less fundamental 

bX). physical theories. A theory will be the more fundamental the less structure is 

assumed a priori. This is plausible, because what is assumed is not explained. 
. Among the a priori structure will be all the axioms of the mathematical theo- 

rS ' ries that are used to formulate the theory. Typically such axioms are relations 

. between mathematical objects, and there is no a priori reason why the relations 

postulated in some arbitrary mathematical theory should find their correspon- 
dence in nature. 

What would be the minimum a priori structure which we need to assume 
in order to build on it a theory of the world? Certainly the structure of human 
thinking will need to be included, because we cannot avoid using it in building 
our theories. Could such an assumption be enough by itself? We do not have 
the power to prove that it is, but it is very interesting to examine the question 
of how far one can go, and it brings us into contact with the whole history of 
human thought. 

Philosophers might object that "structure of human thinking" must mean 
logic, and Imnianuel Kant had proven in his famous "critique of pure reason" 

*Cargese lectures 1996, to appear in: G. 't Hooft ct al (cds), Quantum Fields and Quantum 
Space Time, Plenum Press 1997. Work supported in part by the German Israel foundation 
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that a theory which describes observed phenomena in the world cannot be de- 
duced from logic alone. 

However, studies of linguists Q (and everyday experience) reveal that the 
structure of human thinking is not adequately represented by logic. 

Moreover, Kant assumed too much structure a priori^ including his version 
of the Aristotelian categories, Q and this leads to too many possibilities of what 
can be thought of. Einsteins general relativity principle amounts to postulating 
the absence of a priori structure as we will see. It is well known that this 
relativity principle restricts the form of the possible equations of motion very 
much. Here I try to push this principle to the extreme. 

And last not least, the theory of complex systems has an entirely new selec- 
tion principle, inherited from quantum field theory and unknown to Kant, by 
which to select from all the theories which can be thought of those which give 
rise to emergent phenomena that could in principle be observed. I will come 
back to this later. 

One more objection must be answered. It might be argued that assumptions 
on the structure of human thinking are inappropriate as a priori assumptions, 



they should be deduced from neurophysiological data [16| . But " deduced" is the 
wrong word. According to iron rules of logic, one is not allowed to use in a de- 
duction that which is to be deduced. Building a theory of mental activity based 
on neurophysiological data makes use of the structure of human thinking. One 
is trying to construct a self-consistent picture, and this can certainly produce 
very important insights, but it is not a deduction, and one cannot transcend to 
a level more fundamental than the structure of human thinking. 

I will first state my basic assumption about the structure of human thinking 
informally. It will be made precise in the next section. 

Pre- Axiom: The human mind thinks about relations between things or agents. 


Relations will be regarded as directed. The traditional notation in logic is 
aRb for a relation i? of a to 5. I prefer to use the notation which is now customary 
in mathematics (category theory), where one denotes objects (things, agents) 
by capital letters X,Y,... and arrows (relations) by small letters f,g,..., and 
where f : X i-^ Y stands for a relation from X to Y. 

It will be assumed as a defining property of relations^ that they can be 

^In Hofstadters book "Godel, Escher, Bach" an intriguing interpretation of the notion of 
enUghtenment in Zen-Buddhism is proposed. Briefly it amounts to transcending below the 
level of Aristotelian categories to a more fundamental level of mental activity - thinking free of 
(Aristotelian !) categories. In a way we are attempting something like that when we start from 
general systems and name things which would belong to different categories. The Aristotelian 
categories are not a priori here, in contrast to Kant. In particular, the properties of space are 
not a priori 

There exists now a data base which categorizes and lists thousands of things the human 
mind thinks of, and the relations between them. It was produced by Cycorp corporation 
with a view to commercial applications, sen the eptries "The Cvc Technology" arid "The 



Upper Cvc Ontology" under tittp:///www.cyc.com , especially fittp:/ /www.cyc.com/cyc-2- 
l/toc.htrni| (status Sept 96) 



'it is said that string theory tries to construct geometry from extended objects. But what 
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composed. If there is a relation f : X t-^ Y from X to Y and a relation 
g : Y 1-^ Z from F to Z, then this defines a relation, denoted g o f : X ^ Z 
from X to Z. Think of a friend of a friend, or of a brother in law which is the 
husband of a sister. There can be relations from X to X; among them is the 
identity lx of X with itself. 

Typically, a relation f : X ^Y from X toY specifies relation in the opposite 
direction, denoted f* -.Y ^ X . If X is the wife of Y then Y is the husband of 
X. 

The objects of a system can themselves be systems, i.e have internal struc- 
ture. In this way, a general frame work for the discussion of self organization in 
complex systems is obtained. 

A generalized notion of locality will be built into the axioms. We know 
since the discovery of Faraday's Nahcwirkungsprinzip in the last century that 
fundamental physical laws relate only physical quantities at infinitesimally close 
points of space time. In discretized theories, the notion of infinitesimally close 
is replaced by nearest neighbor relations; this specifies a graph (e.g. a lattice) 
and singles out certain relations as fundamental. All other relations can be 
composed from fundamental ones. The fundamental relations will be called 
links. 

This notion of locality leads to a definition of the notion of emergence which 
is a key concept of complex systems theory. Emergence is the appearance of 
nonlocal phenomena as a consequence of local laws. Propagation of electro- 
magnetic waves is an example, and also the reproduction fork dynamics which 
models the replication of DNA in cells (see ref. [|| and figure ^ below). 

These basic assumptions will be subsumed in the axiomatic definition of a 
system] mathematically it is both a category and a graph. The relations are the 
arrows of the category, and there is a *-operation on arrows. 

The objects are actually of secondary importance. They can be recon- 
structed when one knows which arrows can be composed, and what is the result 
of the composition. 

Classically, the state of the world at one time (and also the world sub specie 
aeternitatis) is assumed to be described by a system of this kind. Quantum 
mechanically, there is a wave function which assigns a complex amplitude to 
systems. 

There are no numbers in this to begin with, and no arithmetic operations. 
One cannot make mistakes of 2tt in the fundamental equations because there 
is no TT. The only substitute for arithmetic operations is the composition o of 
relations. As a result, truly fundamental physical laws - those that can be stated 
in this language - cannot contain any (dimension-less) free constants. Also one 
cannot "add" physical theories (e.g. Einstein + Maxwell) in a familiar way. 

It will be seen later how one comes to correspondences with quantitative 
theories in the first place. 

means "extended" before there is space? It can only mean a property which is going to be 
interpreted as "being extended" after space has been constructed. In the present frame work, 
being a relation is such a property. 
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Coordinates are numerical encodings of positions in some space. The absence 
of a priori numerical structure is a way to push coordinate independence to the 
extreme. 

How does one build a theory of the world on so little a priori structure? It 
proceeds in two steps 

1. Name things, 

2. Make statements about named things. 

In this paper I will be chiefly concerned with fundamental physics. The "things" 

which will be named and examined will be 

electro-magnetic fields and Yang Mills fields, 

space (in the sense of space-like hyper-surface of space time), 

matter (Dirac fields). 

To explain the naming step, it is necessary to distinguish between two dif- 
ferent types of physical laws in the traditional formulation. 

First, there are laws which constrain the state of a part of the world at one 
time. Gauss' law in electrodynamics is a most important example of such a law. 
In a canonical formalism these laws are called constraints. All the fundamental 
physical theories, including general relativity, are gauge theories, and they all 
obey nontrivial constraints. There are further properties which can be read 
off the state at one time, and which are preserved in time. I will count them 
among the constraints. It will be seen that our a priori structural assumptions, 
as poor as they are, provide for a gauge group (or a substitute for it) which 
can be read off the initial state, and for a notion of gauge invariants which 
determines what could be observable in a particular kind of system. The named 
things in the above list will be systems which are distinguished by the validity of 
constraints which are characteristic for them. The statement of the constraints 
must be meaningful, given only the a priori structure which is furnished by the 
axiomatic properties of a system. 

One may ask the philosophical question whether the constraints are really 
physical laws, or just denominations. This brings us back to the discussion 
above. The principle of emergence may single out some of the possible con- 
straints as physical laws because such properties of systems are the only ones 
which can be observed at a macroscopic level. 

Secondly there are laws which govern the dynamics (time development) of 
a system. I will seek dynamical laws which are universal in the sense that they 
can be stated in a meaningful way for any system whatever. This is a very 
restrictive requirement on a truly fundamental dynamical law, because there 
is so little a priori structure which can be used to write down an equation of 
motion. 

In this paper I concentrate on the conceptual issues. The precise form of the 
equations is still open to experimentation. There is an essentially unique first 
order equation of motion, cp. figure ||. To accommodate second order dynamics, 
I admit two different kinds of links - essentially coordinates and momenta (or 
velocities). They are represented by thin and fat lines in the figures. But this 
weakens uniqueness; unfortunately there are now several equations which can 
be written down. Universal forms of the equations of motion of Maxwell, Yang 
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Figure 1: triangular lattice 

Mills and of Einstein are shown in figures ^ and |^. One sees that they fit on the 
template, figure but they are not exactly the same. The constraints, figures 
^, H are also not the same, but this is as it should be. 

It remains to be seen whether the two kinds of links can be fused into a single 
one ^ which satisfies one single universal equation of motion. The universal law is 
supposed to specialize to the known fundamental dynamical laws when applied 
to states which satisfy the appropriate constraints. 

There is no rigorous classification yet, and the investigation up to now are at 
the level of formal discretizations of known continuum theories. The indications 
from the available evidence are that the following theories admit a universal 
formulation in the system theoretic frame work 

1. General relativity with or without massless Dirac matter fields 

2. Yang Mills theory in fiat space with or without massless Dirac matter 
fields 

But Einstein Maxwell theory (or Yang Mills theory in curved space) does not 
appear to fit; it is not unified enough. Also a cosmological constant, fundamental 
masses or fundamental Higgs fields do not fit. The problem comes from explicit 
factors — g) and g'^'^ which cannot be absorbed. I will describe below a 
universal formulation of Maxwell- and Yang Mills equations, of the Einstein 
equations, and of the massless Dirac equation. 

Discretizations of super-symmetric theories have not been investigated yet. 
They ought to be investigated because they may offer the best chance of leading 
to emergent phenomena by virtue of cancelations of divergences at short distance 
which one encounters when one tries to enforce long range effects of short range 
interactions. 

1.1 Einstein's principles 

Let me pause to discuss how Einstein's principles fit in with the philosophy. 

The two underlying principles of Einstein's General Relativity are the prin- 
ciple of relativity, or general covariance, and the equivalence principle. When 
appropriately interpreted these principles are also operative in the gauge theo- 
ries of elementary particle physics (modulo troubles with the Higgs sector) Q. 
It is well known how these principles constrain equations of motion. 



*In the Poincare gauge theory approach to general relativity one tries to achieve such a 
fusion by interpreting vierbeins as vector potentials of the translation group. Pq] 



Figure 2: The universal equation of motion of fundamental physics. The symbol 
symbolizes the effect of one time step. There is a product over all triangles 
which share the link b* (= b with opposite orientation). A variant of the equation 
exists which has the orientation of the triangles reversed. The gauge covariant 
massless Dirac equation is a special case; it governs the evolution of links b to 
or from oo. . 




Figure 3: Maxwell Equations of Electrodynamics. The Yang Mills equations of 
general gauge field theories have the same form. It involves a product over all 
triangles which share the horizontal link. In the presence of Dirac matter, the 
triangle can have a tip at oo. 




Figure 4: Gauss law for Electrodynamics, Yang Mills theory and General Rela- 
tivity. In the presence of Dirac matter, one of the points is at oo. 
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Figure 5: Equations of motion of general relativity. The product is over all 
triangles which share the horizontal link. 





Figure 6: vector and scalar constraint of general relativity. There is a product 
over triangles which share the link i and the corner x, respectively. In 2 + 1 
dimensions there is a simpler version, cp. figure 0. 




Figure 7: Simplified form of vector + scalar constraint for 2 + 1-dimensional 
gravity. The equality must hold true for every thin-lined triangle 
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The principle of relativity is a statement of absence of a priori structure. 
Before general relativity it was thought that space has an a priori structure 
which defines the notion of a straight line. This is equivalent to an a priori 
defined possibility of comparing directions at different points in space. This 
a priori structure is abandoned in general relativity and in gauge theory. To 
compare tangent vectors (or vectors in color space) at different points of space 
time one must use parallel transport of vectors from one point to the other, 
and the result depends both on the path along which one transports, and on a 
connection {Sl{2, C)-gauge field in relativity) which is dynamically determined 
as a solution of equations of motion. Gauge covariance follows. 

In the traditional formulation of general relativity, the principle of relativity 
is not pushed to its logical conclusion, though. The assumption of an a priori 
structure of space time as a differentiable manifold means that one assumes an 
a priori definition of straight line in the infinitesimally small. It has been sus- 
pected for a long time that this is an unreasonable assumption when it comes to 
physics at the Planck scale. In the traditional formulation, general covariance 
demands that there should be no preferred coordinate system. But one assumes 
an a priori defined preferred class of coordinate systems, viz. smooth coordi- 
nates. General covariance is then interpreted to mean that the fundamental 
equations of the theory retain their form under transformations of coordinate 
systems within the preferred class. If one pushes the principle of coordinate 
independence to its logical conclusion, the fundamental equations should make 
sense without any reference to coordinates whatever. 

The principle of equivalence asserts that the motion of material bodies is free 
in a local Lorentz frame. The notion of free motion makes essential reference 
to an a priori defined notion of straight line in the infinitesimally small. But 
when matter is described quantum mechanically, the notion of "straight ahead" 
in the infinitesimally small is no longer needed. Newton's law gets replaced 
by a Schrodinger equation which involves a gauge invariant Laplace or Dirac 
operator. To define it one needs only the appropriate parallel transporters, 
plus linearity which is supplied by the principles of quantum mechanics. In 
conclusion it is reasonable to hope that a sufficiently strong principle of coordi- 
nate independence alone should be sufficient to single out the truly fundamental 
dynamical laws in physics. 

Along the way, principal fiber bundles will go away. Mathematical physi- 
cists tend to think that principal fiber bundles are the essence of gauge theory, 
but it is not so. The definition of a global multiplication from the right with 
elements of the structure group is is an a priori global structure. It amounts to 
postulating certain invariants. It is contrary to the spirit of gauge theory which 
emphasizes locality. The principle fiber bundle structure provides for an a priori 
tensor product of representations which commutes with parallel transport. But 
different tensor products are used in gauge theories with quantum gauge groups 
; they are used in some models of quantum space time [Q . 
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2 System theoretic foundations 



Motivated by the pre-axiom of the previous section I wiU now give a formal 
definition of a system. FoUowing the terminology in category theory, the agents 
of a system will be called objects and the directed relations between them are 
called arrows. 

A complete system consists of objects X and arrows f : X i-^ Y. X is called 
the source and Y the target of the arrow /. Some of the arrows are declared 
fundamental and are called links. They are typically denoted by b. The following 
conditions are imposed. 

1. Arrows can be composed. With f : X i-^ Y and g : Y i-^ Z, the arrow 
g o J : X i~f Z is defined. Composition o is associative. 

2. To every object X there exists a fundamental arrow ix '■ X ^-^ X, called 
the identity arrow; iY°f = f = f°i-x for every arrow f : X t—^ Y. 

3. All arrows can be obtained by composing fundamental ones 

/ = 6„ o ... o &2 o 6i. (1) 

4. To every arrow f : X t-^ Y there is an adjoint arrow f*:Yi-^X such 
that r* = / and (go/)*=/*og* , t*^ = lx- 

5. The graph whose vertices are the objects and whose links are the funda- 
mental arrows is connected. Equivalently: For every pair X, Y there is an 
arrow X t-^ Y 

Mathematically, a complete system is both a category and a directed graph. The 
first two axioms are those of a category, the fourth one asserts the existence of 
adjoints of arrows. To specify the graph, certain arrows are singled out as links. 
The objects are the vertices of the graph and the fundamental arrows are the 
links of the graph. To every link, there is a link in the opposite direction, but 
this requirement will be abandoned. 

In a system, the 4-th axiom is relaxed. I admit that this is motivated by 
hindsight. It is necessary to accommodate certain dynamical processes which 
are very important in biology, such as DNA-replication ||^. The adjoints of 
some of the fundamental arrows b are allowed to be absent. Thus, a system 
can be thought as being obtained from a complete system by declaring certain 
links as absent, and with them all arrows which can no longer be composed 
from fundamental arrows. However, the absent adjoints can be added again in 
a unique fashion. 

We write 6* = if 6 has no adjoint. There are now two possibilities 

6** ^b oi b** = 0. (2) 

Given a category, one needs to single out links to get a graph and a system. 
Let us examine the converse question: Given a directed graph, to what extent 
does it specify the system? 
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Wc can define a path from X = Xq to F = X„ to consist of a sequence of 
links bo,bi, ...,bn such that the target of bi : Xi i-^ -^i+i is also the source of 
bi+i. Paths can be composed by juxtaposition. Adding the identity links, we 
obtain in this way a category and thereby a system S{G) in a canonical way 
from a graph G. Therefore the assumption that relations can be composed is 
in fact a tautological one. It is useful because it institutionalizes the possibility 
that different paths may represent indistinguishable relations, as follows. 

Suppose we start from a system S, and we reconstruct from its graph G the 
system S{G). It needs not be equal to S because different paths may define 
the same arrow in S. The arrows in S arc in general equivalence classes of 
paths. Therefore, given a graph G, a system can be defined by specifying a 
generating set of relations between links. In the spirit of our locality principle, 
local relations are particularly important. 

Two most important examples of such relations are as follows 

b*ob = ix , bob* = LY, (3) 
h o 1)2 obi = i-x (4) 

for all links b : X Y and for all triangles (i.e. loops of three links) from X to 
X, respectively. Interesting generalizations will be encountered when we come 
to the Dirac equation. They differ only in some -signs. 
A further interesting type of relation is 

eo6*=6-^oe (5) 

where e : X i-^ X are square roots of — signs. This could be used to characterize 
sl{2, C) connections as appear in general relativity. 

In the system theoretic frame work, a —sign is a collection of links, denoted 
—Lx : X 1-^ X, such that ~lx ^ lx, but (— ix) o (— '-x) = ix, and (— ty) ob = 
b o {—Lx) for all links b : X Y. 

A system will be said to be unfrustrated if there is at most one arrow from 
X to Y, whatever X, Y. Curvature in general relativity and field strength in 
gauge field theory are instances of frustration. 

We will also need a notion of isomorphism of systems because we will not 
distinguish between isomorphic systems. 

A functor T is defined as in category theory. It is a map from one system 
to another one which preserves identity and composition law. If / : X i— > F 
then T{f) : T{X) ^ T{Y), T{ix) = ij^^x), and T{g o /) = T{g) o T{f). It is 
not required that maps fundamental arrows into fundamental arrows, but it 
is postulated that J^{f*) = J^if)*. 

Such a functor is called an isomorphism of the system if it has an inverse 
functor, and if it maps fundamental arrows into fundamental arrows. 

2.1 The language of thought 

Our assumptions on the structure of human thinking amounts to the postulate 
that the human mind manipulates objects and relations of systems by operations 
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which are well defined as a consequence of the axiomatic properties of a system. 
If one entertains the notion that thinking uses some sort of language, then 
one would be lead to calling the system theoretic frame work the language of 
thought. However, it is different from natural languages and from artificial 
languages including formal systems in one crucial aspect. All true languages 
have a serial structure. They are modeled on verbal utterances which are one 
word after another. General systems have no serial structure. 

Two questions arise naturally. How do properties of systems which occur 
in our mind during mental activity get translated into statements of a natural 
language, and where in the brain does the translation take place? 

Related questions were raised by Raichle [Q in his interpretation of re- 
cent neurophysiological experiments which localize types of mental activity in 
the brain by a differential measurement of blood flow using PET (or Nuclear 
Magnetic Resonance). 



3 Gauge theory aspects 

I tried to make precise the idea that the human mind thinks about systems 
which consist of things and relations between them. It will presently be seen 
that this encapsulates the essence of gauge theories as wc know them in physics, 
in spite of the poverty of the assumed a priori structure. 

Let Gx consist of all arrows g : X i-^ X. They are called loops. Because 
of the composition law, Gx is a semi-group. It will be called the holonomy 
semi-group or local gauge semi-group at X. 

A gauge transformation is a map of the system which takes every object X 
into itself, and arrows f : X t-^ Y into new arrows f : X t—^ Y such that 

giY)f = fg{X) (6) 

for all arrows / and a suitable choice of g{Z) e Gz for all Z . Such a map is 
automatically functorial, |^ i.e. 

(5°/)' = 5'°/' 

In unfrustrated systems, the gauge semi-groups are trivial, i.e. they consist 
only of the identity lx- 

In our physical application, each arrow b : X ^ Y will have an inverse 
such that bob~^ = ly and b^^ ob = lx- In this case the local gauge semi- group 
is actually a group, and it is independent of X modulo isomorphism. It is called 
the gauge group. Gauge transformations take the familiar form 

r ^g{Y)fg{Xr' 

Let us consider some examples of systems. 

^In category theoretical language it is a functor which preserves objects and which admits 
a natural transformation to the identity 
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Figure 8: The structure of a brick wall 



Example 1 (triangulated manifold) Consider a triangulated manifold. The 
objects of the system shall be the 0-simplices, and the links the 1-simplices. The 
adjoint link is given by the 1-simplex with opposite orientation. Imposing the 
two relations (Qj, the arrows f : X of the system will be the homotopy 

classes of paths from X to Y . The gauge group is the fundamental group of 
the manifold. The system is unfrustrated if and only if the manifold is simply 
connected 

Example 2 (brick wall, see figure |^). The objects are the bricks, and the 
fundamental arrows are the translations which take one brick to the position of a 
nearest neighbor. They can be composed to translations to other bricks positions. 
The system is unfrustrated, and the gauge group is trivial. 

Example 3 (logical archetype) The system has two objects, denoted T and 
F and three fundamental arrows other than the identities: 

e:T^ F, e* : F ^ T, a ^ a* : F ^ F 

subject to the relations 

eoe* = Lp, e*oe = LT, ooo — o. 

The gauge semi-groups for T and F are isomorphic to the two element semi- 
group {iF,o} = {1,0} with the usual multiplication law. 

3.1 Representations 

I will introduce a general notion of representation. 

In group theory, a representation is not simply a homomorphism from one 
group to another. It is required that the representation operators are linear op- 
erators in a Hilbert space. As a result, there is an a priori defined multiplication 
for them which is consistent with the linear structure in the Hilbert space in 
the sense that the distributive law holds. 

Similarly, models in model theory px| ] are also a kind of representation. They 
are structure preserving maps whose images are sets. 

And in everyday life, an oil painting has some a priori structure in addition 
to representing the structure of what is painted - it consists of paint of some 
chemical composition on canvas. 

Motivated by this, representations of a system S will be defined as functorial 
maps into some given system or into an element of a class of systems which 
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come equipped with some characteristic additional structure. It is required that 
fundamental arrows are taken into fundamental arrows. 

Example (logical representations) Equipped with a binary product \ , 

T\T = F, T\F = F F\F = T (7) 

the logical archetype (example 3 above) appears as the image of representations 
T of systems which come equipped with a product (binary composition) \ of 
objects. The notion of a product is understood to demand that there are links 
A <— A\B —> B . 0. Objects are interpreted as propositions, links A B 
(other than the identity) are interpreted to mean " A excludes B", and \ means 
"neither nor". If a link's adjoint is its inverse, it gets interpreted as negation. 
A representation assigns a truth value T or F to every object (proposition); the 
representation property ensures that the rules of logic are obeyed, provided it is 
required that the representation preserves composition, !F{A\B) — J-{A)\J-{B), 
and the special links b : A\A i— > A are unitary (i.e. they obey b* — b^^) ) so that 
they will be interpreted as negation. 

Such representations may exist or not, and they may be unique or not. One 
may consider logical representations of arbitrary systems with a product (binary 
composition \) but it is natural to require unitarity b* = b^^ of the special links 
b : A\A I—)- A. One writes -^A — A\A. If there is a subsystem of the form 
-'{A\-iA) I— !■ B then -^B is interpreted as an axiom, because in any representation 
-'{A\-'A) is true. 

A representation may fail to exist because the axioms are contradictory. If 
the representation is not unique, then the truth of some propositions cannot be 
decided from the axioms. 



3.2 Representation of a system as a communication net- 
work 

Next I will state a representation theorem which will show that in spite of the 
nearly tautological character of our assumptions, all the essential structure of 
lattice gauge theory (on irregular lattices) is encapsulated in it, except for the 
linearity of the charge - or color spaces whose elements are subject to parallel 
transport. The arrows will become maps, but not necessarily linear maps. 

Representation theorem: Every (finite) system admits a faithful repre- 
sentation as a network as follows: There are spaces fix associated with objects 
X and arrows act as maps f : fix flY, with lx — id. 

The construction of the space fix uses the sets of all arrows to and from X. 
Details are given in Appendix A. 

For now let us talk of one time. Then the maps / may be interpreted as 
channels of communication. Time development (and acts of communication) is 
only considered later. 

^This generalizes a corresponding construction in category theory 
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The sets need not be linear spaces and the maps / need not be hnear. 
Apart from this, the setup is as in lattice gauge theory.. The objects X may be 
elements of an irregular lattice but irregular lattices were considered before. 

Scholium: (Lattice gauge theory) [ p^ In the Hamiltonian formulation of 
lattice gauge theory, space is a discrete lattice like figure |^. In the continuum, 
one has vector potentials A = Aidx^ which tatie their values in the Lie algebra of 
the gauge group. From them, parallel transporters along paths C in space from 
X to Y are constructed as path ordered products u{C) = Pcxp(— J^A). They 
map the fiber fix of a vector bundle at X into the fiber fly at Y . In lattice 
gauge theory, the parallel transporters along the Units of the lattice are the basic 
variables of the theory. Finite difference versions of covariant derivatives are 
constructed with the help of these parallel transporters. 

Values of matter fields ^{X) could be interpreted as elements of fix but we 
will prefer to regard them as maps (i.e. links) from some "flavor space" floo to 
fix in the later discussion of the Dirac equation. 

The equations of motion and Gauss ' law are the same as in the continuum, 
except that finite difference covariant derivatives in space are to be used. One 
may go on to discretize also time, which means that also time derivatives get 
discretized. 

Let me emphasize that the representation theorem constructs a space fix 
for every object X but it does not attribute a state ^ e fix to the objects. The 
objects have no state. Dynamics consists of structural transformations, not of 
changes of states of objects. This is a big difference to cellular automata. Never- 
theless there is a connection. At the level of effective theories which operate on 
larger scales, the objects can be systems themselves, and so they have internal 
structure. Changes of this internal structure could be interpreted as changes of 
a state of the object. 

It is appropriate to cite also the computer pioneer Konrad Zuse's work on 
"computing space" for similarity in spirit. The frame work which I use here 
to discuss fundamental physics is also employed as a tool in massively parallel 
computing 

Let me clarify that I do not regard values of matter fields '^{X) in gauge 
field theories as elements of fix because otherwise it would be impossible to find 
a universal equation of motion for the links in which the values of the matter 
fields would enter. This brings us to the next topic. 

4 Universal dynamics 

Next we turn to the time development t i—t St oi s. system. Sorin Solomon 
proposed to call it " drama" . It is supposed to be governed by an equation of 
motion. 

According to our guiding principle, the most fundamental equations of mo- 
tion should have the property that they can be formulated purely within the 
frame work provided by the language of thought, without need for any further 
a priori structure. In other words, they should be meaningful for every system 
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whatever. I will call this a universal dynamics. 

Another consideration leads also to universal dynamics: A state should con- 
tain all necessary information about its time development in itself, without need 
for further extrinsic specification. Different kinds of systems should be distin- 
guished by different properties of the initial states. 

Gauge invariance is automatic in a universal dynamics because there is no 
intrinsic way to distinguish between isomorphic representations of a system. 

I will consider dynamics in discrete time. Dynamics in continuous time 
would require some assumptions of a priori structure such as spaces Vtx which 
are manifolds. 

I will assume at first that the dynamics is of first order, so that the system 
St at time t determines the system St+i at time t+l. Generalization to second 
order dynamics will be considered later and reduced to the first order case, but 
with two kinds of links. 

In the spirit of the discussion of locality in section || it is demanded that the 
dynamics is local in the following sense 

Every object is descendent of some object X and every link is descendent of 
some arrow / of the system one time step ago. Descendents of X are determined 
by X and by the fundamental arrows of X alone. Descendent links of / are 
determined by / if / is fundamental, by source X and target y of /, and 
possibly by the fundamental arrows to and from X or Y . 

The formulation of a dynamics is a rule how a new system is to be made 
out of a given one. It is supposed to have the stated locality properties. The 
possibilities of formulating such rules within the language of thought are very 
restricted. In fact, the innocent looking assumption of a system with a finite 
number of links has introduced a priori structure of countability. I exorcize it 
again by not admitting the possibility of counting the number of links to an 
object. It should make no difference if several simultaneous links from X to Y 
are regarded as one link. 

Basically there are three kinds of change with time, apart from death. 

1. Growth 

2. Motion 

3. change of composition law 

This classification applies not only to material bodies in space, but in this paper 
we are only concerned with physics. 

I speak of growth if there is copying of objects or of links, or if adjoints b* of 
links h are newly produced. There can also be fusion of isomorphic subsystems 
under some conditions. The aforementioned reproduction fork dynamics is a 
universal dynamics. It models DNA replication |^ but has also much more 
general copying capabilities. It is a local dynamics which propagates a copy- 
process in such a way that systems of completely arbitrary topology can be 
copied. It is shown in figure ||. For further explanation see ref . . Locality is 
important because enzymes in a biological system act locally. 
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Figure 9: Reproduction fork dynamics - a universal copy machine for systems. 
A pair of links without adjoints to and from an object X is called a fork. The 
presence of a fork causes X to be copied. The bidirectional links get split to 
become forks and the two halves are divided among the copies of X. The links 
which had no adjoint before get one. Once a copy process is started at some 
initial object Xq, the forks travel through the whole system and one gets two 
copies of the system as a result. (The dotted arrows are only there to indicate 
the fact that the objects are copies of each other.) This works for systems of 
completely arbitrary topology. 

In principle there exists the possibility of a change in the composition law 
f ° 9 ^ f ° so where s is a loop. But I will not enter into a discussion of this 
possibility here. 

Here I will be chiefly interested in motion. It consists of changes of arrows. [| 
This includes changes of relations of an object X to itself; these relations could 
be regarded as properties of X. 

The possibilities are very limited. How can a link b' : X Y oi the system 
at time t + 1 arise? It can only have been composed from links of the system 
St at time t. (Creating new links e.g. by taking adjoints would be regarded 
as growth). But all that can be made by composing links is an arrow of the 
category. So the rule has the form 

b' = f (8) 

where / is a possibly composite arrow of St- But this means that the category 
does not change at all. ^ The only change is in the specification which arrows are 
considered as fundamental. Motion means that composite relations are declared 
fundamental. One can think of it as composition of links or bonds by objects 
which act as catalysts in a manner which is familiar from chemistry, see figure 

Such catalysis of relations also plays a basic role in Spinoza's famous treatise 
on ethics, Ethica, ordine geometrico demonstrata p^ . 

The interpretation of the motion of a point particle in space is shown schemat- 
ically in figure 1 1 , 

Let us return to the analysis of the possibilities for / in the general formula 
(||) for motion. 

^As I mentioned before, the objects are secondary. This is so because they can be recovered 
from the arrows and the composition law according to the representation theorem. This is true 
up to isomorphism. Isomorphic systems are not regarded as different. There is no intrinsic 
way of distinguishing between them. 

**This reminds of Parmenides, the Greek father of ontology. He held that nothing can ap- 
pear or disappear in the world because this would contradict the nonexistence of the nonex- 
istent. Such changes are only apparent ones to man. 
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Figure 10: Catalysis in chemistry. Enzyme E binds molecules A and B. First a 
substrate-enzyme complex is formed where A and B are bound to E. Then the 
composite link between A and B is transformed into a fundamental one 




Figure 11: Interpretation of motion as transformation of a composite relation 
into a fundamental one. The objects which are connected by bidirectional links 
are interpreted as space points, and the other object as a particle (or as "the 
idea of matter"). The link from the particle to a space point x represents the 
relation of "being at x" . Motion takes place when a composite arrow made from 
the relation b of the particle to its former position, and a relation of this space 
point to a neighbor is declared fundamental, while b loses this status, b remains 
in the category as a composite arrow. 

The links in / are restricted by the locality demand. The simplest possibility 
is as follows. 

b^bob*ob. (9) 

This reminds of Hegel's dialectic process. Let us follow Hegel in speaking of a 
"Denkbestimmung" in place of a link. Then the process gets verbally described 
as follows. A Denkbestimmung (thesis) combines with its opposite (antithesis) 
to form a new," more advanced" Denkbestimmung (synthesis). Actually some- 
thing new is obtained only if b* is not the inverse of b. 

The dialectic process is truly a universal dynamics in the sense of the above 
definition. But it is not the only one. Neither Maxwell's equations nor Einstein's 
are of this form. 

Other possibilities are found by making use of links bi to or from X ov Y 
other than the original link b : X Y and its adjoint 6*. Assuming, contrary 
to Hegel, that b* o bi constructions yield nothing nontrivial, the useful links can 
only occur in triangles Aj which contain 6 or b*. The triangular paths have the 
form 

Ai = b*oblobl (10) 

or adjoint of that. This involves links b] : X ^ Z , b'f : Z ^ Y , {Z X, Y). 
DifFcrcnt triangles cannot bo distinguished in an intrinsic way. Therefore they 
will have to appear symmetrically in the rule. Also we have no way of adding 
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contributions. The only possible composition is with o . This leaves us with the 
possibilities 

6 i-> 60 Ai o ... o A„ (11) 

or 

Alo ...o A*^ob (12) 

where Ai... A„ are all the different triangular paths of the form (p^. 

This is the universal equation of motion of fundamental physics, modulo 
some complications which we will discuss below. 

The path on the right hand side of eq. (|l^) starts with a factor bob* which 
could be omitted. 

A schematic graphical representation of cq. (|l^) is in figure ||. 

4.1 Universal conservation law 

In the absence of growth processes, a universal equation of motion implies sub- 
stitutions 61-^/ which replace links by arrows which existed before in the 
category. Therefore the category Cat{S) does not change in the course of time. 
Quantities Q which are determined by the isomorphism class of the category 
are conserved. In particular the gauge group is time independent. 

If there are growth processes, new objects which are copies of old ones may 
appear, and also new links which are copies or adjoints of old ones. In this case 
the conservation laws are more subtle. 



5 Maxwell's equations 

In general, a system is identified as space by the validity of certain constraints. 
This will be discussed below when we come to general relativity. In this section, 
I am not interested in this aspect and I will assume that we know already what 
is a discretized flat space. Let us think of a triangular lattice like figure |l|. 

Maxwell's equations come in two groups. The first group states restric- 
tions on the initial state. The equation div B = is automatically satisfied 
through the introduction of a vector potential, and there remains only Gauss 
law, div E = p. Gauss law in our language is shown in figure ^ 

The second group contains equations of motion. Their universal version is 
shown in figure (^). 

The Yang Mills equations have exactly the same form, apart from an ordering 
problem which will be discussed. But the gauge group - which is a property of 
the initial state - is different. The equations retain their form in the presence of 
Dirac matter fields, but in this case one of the points in the diagram represents 
a flavor space (or point at infinity) rather than a space point. 

I emphasize that these equations are generally meaningful, but they reduce 
to Maxwell's or Yang Mills equations only on a regular "flat" lattice. I do not 
know a universal version of the Maxwell or Yang Mills equations on curved 
space. 
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The gauge group is part of the data which specify which particular aspect 
of the world we are dealing with. In Electromagnetism, the gauge group is 
isomorphic to U{1). This group admits a natural parameterization by real 
numbers 0...2tt. The magnetic field is a an element of the gauge group (parallel 
transporter around a loop) and so it acquires a numerical status. Similarly the 
electric field is a loop which involves parallel transport forth and back at two 
successive times. In this way they both become "quantities" in spite of the fact 
that the general frame work knows no numbers. They are gauge invariant and 
therefore observable in principle. 

If the gauge group is noncommutative, there arises an ordering problem. In 
what order shall the triangles be traversed? The only reasonable answer is "at 
random". This introduces some stochasticity which may be thought to be a 
remainder from the quantum theory. In the formal continuum limit its effect 
disappears. 

In the quantum theory, the superposition principle furnishes a commutative 
operator +, and we can sum over triangles instead of composing triangles with 
the help of o. So the above ordering problem disappears, but in its place we 
have the "ordering problems" of quantum mechanics. A universal Schrodinger 
equation is described in and in section |l^ below. 

6 General relativity 

Next I turn to the equations of motion and constraints of general relativity. 
The constraints are the properties which a system should have in order to be 
interpreted as space (in the sense of space-like surface in space time). Among 
the constraints is the selection of the gauge group Sl{2, C). In 2+1 dimensions 
it would be the covering group 5/(2, R) of the 3-dimensional Lorentz group 
instead. 

Also among the constraints is the existence of an invariant trace tr which 
maps loops to numbers. 

The remaining constraints are shown in figure ^, and the equations of motion 
in figure ^. They come from the canonical formulation of general relativity 
in Ashtekar variables. For the readers convenience a brief review is given in 
Appendix B. 

There exists also a version of the constraints which does not involve tr, cp. 
[ p^ . It is only equivalent under invertibility conditions on the dreibein. A 
similar reformulation exists in 2+1 dimension where it takes the very simple 
form of a flatness requirement Fij{x) = 0. |^ In our language this is eq.(||) 
(figure ^ for thin links bi. It says that thin loops around triangles equal the 
identity. 

7 Interpretation of the equations of motion 
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7.1 Syntax and Semantics: Classification of initial states 



The graphical form of the Maxwell, Yang Mills, and Einstein- Ashtekar equations 
could have been written down by pre-Sokratic philosophers. They must surely 
have conceived of the idea that the world evolves by rearrangement of triangles. 
But they could not possibly have found the proper interpretation. 

And some readers will no doubt be left in a state of perplexity by the claim 
that the universal equations of motion (||) contain the Maxwell, Yang Mills 
equation. The precise correspondence will be explained. But the perplexity 
itself is worth a comment. It makes it clear that the problem of describing 
nature and its laws is not solved yet by stating laws in the form of equations. 
These equations live on a purely spiritual level. They are syntactical rules. 
In addition one will ask for their meaning. This is the question of semantics. 
It asks how entities in the equations correspond with phenomena which can 
be observed in nature. Physicists say the theory itself must determine what 
quantities are observable. The question about meaning is therefore divided into 
two questions. The first questions is what quantities from the equations can be 
observed in principle. The second question is where to find these quantities in 
nature. 

The first question has already been answered. Observations give answers 
yes or no to questions whether certain statements about a system are true. The 
permissible statements are those which can be formulated in the language of 
thought. It was pointed out that this implies that the observable quantities are 
gauge invariant. 

The second question is rarely considered in practice. Typically equations in 
physics are written down in order to explain certain phenomena. In this case 
the observable quantities have already been fixed a priori by the scope of the 
investigation. In the present frame work the question is tied to the question how 
systems specified as initial states identify themselves - e.g. as electro-magnetic 
fields to which Maxwell's equations of motion would apply, or as space (=space- 
like hyper-plane in space time) to which the equations of motion of general 
relativity would apply. The general answer is this. They identify themselves 
by properties which can be formulated in the language of thought, and which 
are preserved (in their totality) by the time development. Physicists call them 
constraints. Semantics demands therefore that systems with these properties 
are given names in natural language. 

The question may arise whether all possible such properties will also occur in 
nature. The practical answer is that not all possible properties can be expected 
to give rise to phenomena that can be observed at scales which are very large 
compared to scales set by the equations. Only properties which give rise to 
emergent behavior can be observed in practice. This shows that fundamental 
physics is part of complex systems theory ^ . 

In electrodynamics and Yang Mills theory, the familiar constraint is Gauss' 
law. It can be formulated in the present frame work in the form of statements 
"loop = identity" . The result is shown in schematic form in figure ^. 

In addition there are further properties which assert the existence of in- 
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variants. The properties of this type are hnearity of the state spaces ilx and 
of the maps / between these state spaces (hnks b =lattice gauge fields), and 
the identification of the gauge group G and of as a representation space 
of it. Linearity amounts to the existence of invariant operators for addition, 
+ : X fix 1-^ ^x and for multiphcation with numbers * : C x fix i-^- Q.x 
with associativity, commutativity and distributivity properties. [The definition 
of the gauge group was described before.] The defining property of invariants 
is that their parallel transport is path independent. As a result they can be 
globally defined in such a way that they commute with parallel transport. 

The gauge group in Electrodynamics is Abelian. This amounts to the state- 
ment 

s o s' — s' O S 

for arbitrary loops s, s' : X X. 

The constraints in general relativity include the Gauss law. In addition there 
are further constraints. Their statement involves an invariant tr whose existence 
is also one of the constraints. It maps loops to numbers. The further constraints 
are shown in figures ^ The equations of motion of general relativity are shown 
in figure ^. 

All of this discussion is at the level of formal discretizations of the standard 
theories. 

7.2 Interpretation of the universal form of McLxwell's equa- 
tions 

Maxwell's equations of motion are of second order once the vector potential is 
introduced. Therefore the initial state needs to specify both coordinates and 
velocities or momenta. As a result, there will be two kind of links. They will 
be printed thin and fat, respectively. One could try to distinguish them by 
properties in an intrinsic way, e.g. by postulating that one kind has an adjoint 
and the other does not. 

In the continuum, the canonical variables are vector potential A(a;) and 
electric field E(x), and the Maxwell equations are 

A = -E (13) 
E = curlB (14) 
B ^ curlA. (15) 

On the lattice one uses instead exponentiated quantities. We may label the links 
with target x in some way hy i = ±1, ±2, .... They have a direction and a length 
given by a vector i. Choosing X and i will select a link bi{x). The opposite 
link to bi{x) is denoted + i). Let Ai{x) and E'^{x) be the components of 

A and E in the directions of the links . 

The thin links will be the parallel transporters of lattice gauge theory. As- 
suming a vector potential which is smooth on the scale of the lattice spacing, 
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the parallel transporters can be approximated, 

U{b) = Pexp ^- J A^dx''^ w exp (^~Ai{x + for b = b,{X). (16) 

The fat links are taken to be 

P{b) = U{b) exp {~tE'{x)) . (17) 

where r is a discrete time step. U{b) and P{b) are complex numbers of modulus 
1; the vector potential and the electric and magnetic field are regarded as pure 
imaginary - otherwise a factor i has to be put in the exponents. The i?- variables 
on opposite links are related by 

exp (^-tE(-'\x + i)) U{b) = U{b) exp {tE'{x)) (18) 

These quantities will be functions of a discrete time t . It follows from 
eqs.(|l|) and (|l|) that 

Ut+r{b) = Ut{b)eicp{~TE\x)) (19) 

= Ut{b)Pt{-b)Ut{b) , (20) 

which is the second of the universal Maxwell equations in figure ||. 

Consider now the triangles A in the first equation in figure ^. In the limit of 
small lattice spacing, path ordering can be neglected and the parallel transporter 
around a triangle is given by the magnetic flux <f>A through the triangle 

C/(A) = Pexp(- / Ada;) = exp(-$A) (21) 

J A 

$A « ■ (area of A) (22) 

where is the magnetic field perpendicular to the triangle. The two triangles 
Ai and A2 on opposite sides of the link in figure ^ have opposite orientation. 
Therefore the factors exp (— $Ai) will cancel, except for the effect of the change 
of the magnetic field component in the direction perpendicular to the link 
and to B-^. This change is part of the component of curlB in the direction of 
the link - all of it in 2 space dimensions. Taking the product over the pairs of 
triangles in all directions perpendicular to the link, one gets exp(— 7? • cwr/Bt) 
where 7 is a dimensionful constant of geometric origin. The right hand side of 
the equation is this product multiplied with Ut{b). So the equation reads 

Pt+r = Ut exp(-75 • curl-Bt) . (23) 

For suitable choice of the time step, r = ^^/7, this is an exponentiated form of 
the Maxwell equation (p^), because 

Pt+r = Ut+re^v{-rEUr) (24) 
= ?7t exp (r^;,*) exp (-t£;,V) (25) 

= Ute^^(-T^Ei] . (26) 
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for small r, by definition of P and the equation of motion for Ut- This completes 
the discussion of the Maxwell equations. 

For future reference, we rewrite eq.(pl|) in terms of the curvature tensor Fij. 

The smoothness assumption implies that the relation to continuum quanti- 
ties [temporarily distinguished by greek indices] is as follows: 

Mx) = i^A^{x)^Oia) (27) 
F„(x) = ?YF^,{x)^Oia^) (28) 

Smoothness of A requires that a suitable gauge is chosen locally. 
The discretization ( p^ preserves the properties 

U(~b) = U{b)-'^ (29) 

(7(A) = cxp(^-^F,,{x) + ..}j (30) 

trU{A) = l + ltrF^^{x) + ... (31) 

for a triangle A with corners x, x+i, x+j. To see this one computes from the def- 
inition of the parallel transporters U{A) = 1 — ^ {diAj — djAi -\- [Ai.Aj]) [x) + 
0{a'^). It follows that eq. ( pO|) holds with dots representing terms which are of 
order a? and traceless. Eq.(|31[) follows from this. All this is familiar from lattice 
gauge theory on a lattice of lattice spacing a. 

The parallel transporters U{h) represent the thin links h. In the case of 
general relativity, the vector potentials and the parallel transporters will be 
denoted by uj^, u in place of Ai, U. 



7.3 General relativity in discrete time 

In Ashtekar's canonical formulation of general relativity in the continuum, the 
canonically conjugate variables (in the spinorial formulation) are as follows. 

There is a vector potential i^i{x) G sl{2, C) which governs parallel transport 
of spinors along paths on the space-like hyper-surface. The canonical conjugate 
to it is a spinorial version of the dreibein density, 2&-{x) G sl{2,C). It is a 
traceless hermitian 2x2 matrix. It is the analog of the electric field in Yang 
Mills theory. 

The equations and motion are 

2i^k = iV[Ffc„e*], (32) 
2e = ND^[e'',e^] (33) 

The lapse function N_ can be chosen arbitrarily. Dj is the s^(2, C)-covariant 
derivative. 

The constraints are as follows. There is the Gauss constraint which is the 
same as in Yang Mills theory, 

Ae' - 0, (34) 
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In addition the following scalar and vector constraints hold 

H = tr (F^e^e^) = 0, (35) 
H, = tr (F,,e*) = 0, (36) 

(37) 

The second equation of motion can be rewritten with the help of the Gauss 
constraint as follows 

2e = -N_[e\Dje% (38) 

This brings about the dilemma which of the two versions to choose as a candidate 
precursor for a universal formulation. I choose eq.(|3^) because it has a more 
natural geometric interpretation. If e^fe = ±1 as usual, depending on whether 
ijk is an even or odd permutation of 1,2,3, and if one defines 

then the right hand side of eq. (|3^) is expressible as a covariant total derivative 
of /. This parallels the situation in the Maxwell equations, where curlB can 
also be regarded as total derivative of a 1-form eijkBjkdx"^. 

A certain combination of the Gauss and vector constraint is supposed to 
ensure the diffeomorphism invariance of the theory. It is a special property 
of space, not of the universal dynamics which is supposed to be much more 
generally applicable. The equations which govern the universal dynamics have 
a different kind of coordinate independence. 

There exists in the literature a lattice formulation of general relativity in 
Ashtekar variables which is very similar to lattice gauge theory . In the case 
of the 2+1-dimensional theory there is a fully consistent discrete formulation of 
quantum gravity due to H. Waelbroeck j2^. It uses the fact that the scalar and 
vector constraints in 2+1 dimensions are equivalent modulo invertibility of the 
dreibein to 

F,,{x) = 0. 

In the lattice formulation, the vector potential gets replaced by parallel trans- 
porters u{b) € SL{2,C). Apart from the gauge group this is much as in the 
case of Yang Mills theory. The parallel transporters are assigned to links b of 
the lattice. 

In the continuous time formulation, the dreibein on the lattice remains an el- 
ement of the Lie algebra. It is also assigned to links of the lattice (more precisely 
to one of their endpoints; the dreibein transform under gauge transformations 
like matter fields in the adjoint representation which sit on sites x). This is 
very similar to the electric field in the Hamiltonian formulation of lattice gauge 
theory ||l|. 

Let us again label the links b = bi{x) which enter a point x of the lattice by i. 
The link in opposite direction shall be b-i{x -t- i). Henceforth, labels z, j, k shall 
refer to links of the lattice. To avoid confusion, space indices will be labeled by 
/i, J/, ... when they are needed. The basic variables are then u{bi{x)) = Ui{x) € 
SL{2,C) and e'(a;) S sZ(2,C) 
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A continuous time formulation needs much a priori structure; it seems im- 
possible to formulate it without assuming at least that our spaces fix are man- 
ifolds. Because of this, time will also be discretized, and the dreibein will be 
exponentiated to 

=exp(-Te'(a;)) (39) 

r will be related to the time step. As a consequence of corresponding properties 
of the discretized dreibein variables the f -variables on adjoint links are 
related by 

Ui{x)£'{x) = £-'ix + l)u,{x). (40) 
The Gauss law reads (to leading order in r ) 

l[£\x) = l 

i 

In the graphical representation the thin links represent parallel transporters 
u{b) = Ui(x) and the fat links are given by 

p{b) = p,{x) = u,ix)£^x) (41) 

They obey the unitarity relation p(6*) = p{b)^^. The £- variables get represented 
as hair pins, cp. the left hand side of figure ^ (2nd equation), and the Gauss 
law takes again the form of figure (|^). 

In this approach the geometry is not in the lattice but in the parallel trans- 
porters on the links of the lattice. For simplicity I will assume that charts in 
the continuum which cover a sufficiently small neighborhood of a point can be 
represented by a regular triangular mesh, as in figure |l|. In 3 space dimensions, 
a dense sphere packing can be used. It is assumed that the continuum vector 
potential is smooth on the scale of the mesh. This introduces a small parameter 
e. 

To justify the discretized version of eq.(^), the familiar relation between 
the group theoretical commutator xyx^^y^^ and the Lie algebra commutator 
[X, Y] is used. It yields 

£^{x + j)£'=(x + j)£^{x + j)-^£\x -f j)-^ - 1 + T^[&{x + j), e^x + j)] (42) 

The left hand side is represented by a composition of four hair-pins. They can 
be seen in figure ^ (2nd half) except that the initial and terminal thin lines are 
missing. The links of the triangle in the figure are hj{x), bk{x + j), b-i{x + i). 
The four hair-pins make a loop from x + j to x+j. To be able to compose these 
loops for different triangles with the same side bi{x), parallel transport from 
X = J to X is needed. There are two paths (with one or two links) to choose 
from. The difference does not matter because its effect is of higher order in e. 
The choice is made so as to minimize the total number of links in the path. 

The rest of the argument goes as in the case of the Maxwell equation. The 
contribution from triangles on opposite sides of the link bi{x) will cancel out 
except for a contribution proportional to the covariant divergence. 
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The other equation of motion and the constraints involve the curvature. 



There is an equation like eq.(22) which relates the parallel transporter around a 
triangle to the magnetic field, except that now the analog of the magnetic field 
is the curvature of the SL{2, C)-connection on the space-like hyper-surface. 
Consider 

Mfc,t+r = "fc,t(l - Tt^fc + ...) (43) 

Uk is represented by the horizontal thin lines in figure ^ 
The bitriangle in the first equation of figure || is 

Ai = u{Ar\,{xrp,{x)u{A) (44) 
u(A)-i(l-Te^(x) + ...)u(A) (45) 

= l + I[e-'"(x),Ffe,] + ... (46) 

= (1-Tc^fe + ...) (47) 

by eq.(^ and Einstein equation (|3^). Comparing, we see that the figure re- 
produces the time evolution (Esl). 



8 The Dirac equation 

The content of this section is based on joint work with B. Holm and D. Liibbert 
[|9|. 

The massless Dirac or Weyl equation in flat 4-dimcnsional space time reads 

- ihtp = ihca'' ■d.i'ijj = hcDip (48) 

with matrices a* which obey the standard anti-commutation relations 

{a\a'}+ - 2S'n = 1...3) (49) 

It follows from these relations that D is a square root of the negative Laplacian 
-A. 

The appropriate inverse dreibein for a flat space is e^,* — SJ" with = 
det{e^) = 1. It follows that the Ashtckar dreibein-variable in flat space 

a' = e' = eejcr'' (50) 

obeys the anti-commutation relations (p9|). 

Let us denote by —i the opposite direction to i; d-i — —di. There is no dis- 
tinct positive direction, and a^di should have a meaning independent of choices 
of positive directions. Therefore one should set 

a' = a ' . (51) 

The anti-commutation relations can now be written as 



a a. 



a. OL 



-o^a} {% ^ ±3) , (52) 
-1 , (53) 
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and o?'^ = OL^OL^ — 1. We see here -signs which are of crucial importance, 
especially the second one. The product is a product of matrices. 

In the Kogut Susskind discretization of the Dirac equation, a cubic lattice 
with lattice sites x substitutes for continuous 3-space. In place of the Dirac 
a-matrices one has numbers 'rf(x) attached to links (x + i,a;) between nearest 
neighbors x and a; + i on the lattice (i is the lattice vector in i-direction). In 



place of the Dirac algebra (52) one has the relations 



^\x+j)^|{x) = -if{x + i)'i]'{x), {ij^±j) (54) 
ri^'ix + i)7]'{x) = -1. (55) 

Usually one requires in addition 

rf{x)='q'{x + i)f]'{x) = l. (56) 

But this relation can be abandoned. The rj-s sit on links. We may think of them 
as parallel transporters of an (external) gauge field. (In the Kogut Susskind 
formalism the gauge group is Z2.) Validity of eq(|5^) can then be assured by a 
gauge transformation. This is seen as follows. 
It follows from equations (|5j) that 

'n'^'ix + 2j)rf'\x) = Tf^{x + 2i)ri^\x) (57) 

for all We may restrict attention to a sub-lattice of twice the lattice spacing, 
with parallel transporters 77^*. Eq.(|57|) tells us that 77^' is a pure gauge. Therefore 
it can be gauged away by a gauge transformation 

ip{x) 7(a;)-0(a;), l{x) G C. 

If the rj' s are either numbers ±1 or ±i then 7(2;) can be chosen in Z2. 

The Kogut Susskind discretization of the massless Dirac equation in contin- 
uous time is the 5i 1— > limit of the following equation 

i){x,t + 5t) ^i^{x,t) + St'^-q-'ix + 'i)ijj{x + i). (58) 

i 

The sum goes over all nearest neighbors x + i of a;, i.e. over positive and negative 
directions, h dropped out and we put c — 1. The equation is invariant under 
the aforementioned gauge transformations. One may demand ry*(x) = ±1. In 
this case the gauge freedom restricts to Z(2)-gauge transformations ^{x) — ±1. 
The group Z(2) is the center of SL{2, C). It remains as a gauge group after one 
uses the flatness condition to gauge away the SO{3, l)-connection. 

So far we considered a cubic lattice. The consideration can be generalized 
to a triangular lattice (dense sphere packing) jl^. It turns out that one needs 
to augment the constraints (jsj) by the additional condition that the square 
of the parallel transporter around a triangle is (—1) in order to deduce the 
unfrustratedness condition As a result the rfs take values ±i. 



27 



Now we wish to embed these formulae into the general frame work. In the 
case of the simple Dirac equation in flat space and without gauge fields, the 
parallel transporter along the link from x to x + i shall be given by the number 
ri^{x) and fix is isomorphic to C. 

As discussed before, we wish to think of the spinor fields their 
adjoints ijix) as links to and from oo. We parameterize these links by elements, 
also denoted ip{x) of the linear space fix and ip^x) in its dual space, respectively. 
We use the addition + in these vector spaces in order to fix the composition law 
o of arrows at oo in a manner which will now be described. 

oo 

We wish to exhibit properties of the initial state which guarantee that the 
universal dynamics reduces to the Dirac equation. The composition law is de- 
termined by the initial conditions. Therefore we are free to select an appropriate 
composition law, and to use other properties of the initial state to construct it. 

The composition law o of arrows at finite points X of space is decreed to 

be given by the composition of maps between spaces fi, in accordance with the 
representation theorem. This theorem would tell us that there is also a space 
floo ■ But this is of no help, because this space could only be constructed once 
we know the composition law o . In accordance with the discussion in section 

oo 

we construct this composition by imposing relations between paths. 
A general path which goes from X to Y, meeting oo a number iV > of 
times will have the form 

...^ O ll O ...Z7V-I ° 

00 00 00 

where tp : 00 t—^ X, and ipix) : Y >—>■ 00 arc links, and : 00 1-^ 00 are paths. 
The dots represent arrows which come from paths that do not touch 00. We 
impose the following relation. If li is a triangle of the form 

^* o ui o (^1 : 00 00 (59) 

where ui is a link between finite points, then 

^ o ll — ^ + ui o . (60) 

while ip o ll =0 otherwise. If W-.-Im arc of the form ( |59| ) with different Ui and 
Q then the relation may be applied several times to produce a sum. 

tpoliO ...O Im ='llj + ^UiOQ . 

Using this we see that the Kogut Susskind discretization of the Dirac equation 
can be written in the form of figure ^. 

If there is a gauge field, the Sl^, become representation spaces for the gauge 
group, and the 77' get multiplied with parallel transporters that are furnished by 
the gauge field. Only the first of the relations ( |5^ ) survives this, and the square 
of the covariant Dirac operator is no longer equal to the negative Laplacian. 

In curved space, the parallel transporters are associated with the Ashtekar 
variables oji and the 77* involve the Ashtekar variables e'. 
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9 Practical improvements 



If one wants to use discretized versions of the Maxwell Yang Mills or Einstein 
Ashtekar equations in computer simulations, some improvement is called for. 
First, ons should include growth processes in the dynamics such that the grid 
gets refined when the electric or magnetic field (or their analogue) get large. 
Secondly, one ought to guard against accumulating violations of the constraints 
from rounding or discretization errors by including suitable gradient terms which 
tend to restore the constraints. 

10 Quantum Systems and Quantum Space Time 

Now we wish to proceed to a quantized theory. We use Schroedinger wave 
functions. In standard quantum mechanics, they depend either on coordinates, 
or on momenta, but not both. Accordingly we assign complex amplitudes 



to systems S which contain only one kind of links. If we come from a classical 
description, we must choose either thin links b or fat links b. In gravity we settle 
for the first possibility; the thin links represent SL{2C) parallel transporters in 
this case. 

Before going into gravity, let us consider a simpler example. 

Example Quantum mechanical motion of particles in space 
can be described by a universal dynamics. We may picture a system of objects 
("space points") linked by bidirectional arrows plus additional objects ("point 
particles") linked to space points (their positions) by one unidirectional arrow 
each, as in figure |71|. The space is considered constant, the particles may move 
as indicated in figure |7J. In the case of a single particle , the system S = S[x\ is 
then determined by specifying the position x of the particle and we may identify 
our Schrddinger amplitude "^{S[x\) with a standard 1-particle Schrodinger wave 
function 

A Schrodinger equation for the complex amplitude of such a system S reads 



Summation is over moves /i of individual particles as in figure For a single 
particle of mass m on a triangular or cubic lattice of space points, this is the 
standard discretization of the Schrddinger equation for free motion. To see this, 
recall the standard discretization of the Laplace operator on a cubic lattice of 
lattice spacing a — 1, 



where x + fi is the nearest neighbour of x on the lattice in fi direction. Units of 
time h/2m are set to 1. 



■^{S) e C 



(61) 




(62) 



A*(a;) = ^[^{x + A) - *(a;) 
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At least in the case of a compact space, the dynamics of quantum gravity is 
the most universal one that one can conceive of. There is no time dependence 
because the Hamiltonian is zero. The interpretation of this fact is being debated. 
It is tempting to interpret Parmenides as saying that dynamics comes into the 
world only through the separation between observer and observed. I come back 
to this, but for now let us accept that the only remaining problem in quantum 
gravity is to find the space of wave functions. And for any system is a nontrivial 
problem to find this space because of the constraints. 

The wave functions will be required to be gauge invariant. More generally, 
the wave function cannot assign different amplitudes to systems S which cannot 
be distinguished in an intrinsic way by properties which can be formulated in 
the language of thought. 

In the quantum theory, gauge invariant functions of the parallel transporters 
will become multiplication operators. Examples are operators trl, where I = 
bio...obn are loops and where the trace tr is a a gauge invariant function on loops. 
Operators which involve canonical momenta (" fat links" ) in the classical case 
will have to be represented as substitution operators. They are called "grasp 
operators" in quantum gravity They map to linear combinations of wave 
functions for values of the argument S which are obtained from each other by 
some operation on S which is specified by the observable. 

The interpretation of this prescription is somewhat subtle though. How 
would one specify a special loop I in a general system S7 In general this is 
impossible. But there are observables like Y[ '^f^ (products over all triangular 
loops in S) which are well defined products of such loops for all systems. Op- 
erators which have a physical meaning, such as the constraints, will have to be 
of this type. 

This gives us one immediate candidate for a solution to all conceivable con- 
straints of the kind which demands invariance of the wave function under a 
particular possible dynamics. If the operation of the substitution operators on 
S is compounded from local actions, they can not change the category of S. 



[This generalizes the classical result of section 4.1.] Therefore wave functions 



^{S) which depend on S only through its category will obey all reasonable con- 
straints. This reminds of the construction of thermodynamic ensembles from 
conserved quantities. 

Let us specialize to quantum gravity. How can one label a basis in the space 
of wave functions ^? The problem has two parts. A system S is specified by a 
graph G = Graph(S) and by relations between the paths on G. In the traditional 
formulation, these relations are given implicitly by prescribing a gauge field 
configuration on G. The Mandelstam relations are among these |2^. The first 
problem then is to label wave functions tpG,p whose arguments are gauge field 
configurations, i.e. systems S with a given graph G. The second part of the 
problem concerns the generalization to wave functions which are defined for 

^ Such a function tr always exists. An example is as follows. Consider loops I : X i-^ X 
and set tr I = I if I = lx and otherwise. When Qx are linear spaces, the trace of linear 
operators can be used instead. Remember that the existence of invariants like tr is to be read 
off from the initial state 



30 



arbitrary systems S. I will restrict attention to wave functions whose support 
consists of systems whose graph is the skeleton of a simplicial complex. 

The first problem has recently been solved within the loop space approach 
to quantum gravity ||2l|, ^ with the help of Penrose spin networks |l^, |2^, Q . 
The issue had been to take account of the relations between traces of loops 
which hold true for every 5^(2, C) gauge field. 

A Penrose spin network P is a graph with an assignment of a positive integer 
n{b), called its color, to every undirected link b of the graph, subject to certain 
conditions. If all the nodes are trivalent, the condition is as follows. The sum of 
the colors of the three links incident on a node has to be even, and none of them 
is larger than the sum of the other two. Since I prefer to work with directed 
graphs where every link b comes with a link b* in the opposite direction, 1 set 

n{b) = n{b*). 

An embedding p = ^q{P) of a Penrose spin network in a graph G is a map of 
distinct nodes of P to distinct nodes of G and of links of P between nodes to 
paths in G between corresponding nodes. The paths must not intersect except 
as prescribed at the endpoints. 

To be in agreement with the literature, I change nomenclature. What has 
been called a loop up to now will henceforth be called a simple loop. And loops 
a are collections of simple loops ai...ak- One sets 

k 

tra = {-)''Y[traa 

a=l 

To every embedded Penrose spin network p, a formal sum of loops a with 
coefficients Cq, = ±1 is assigned by a certain prescription 

p^^ Caa 

a 

Let l{b) be the number of times the path passes through link b of G. The embed- 
ded spin network specifics an integer n(b) > for links of G. The prescription 
is such that n{b) — l{b) + l{b*) and there is antisymmetry under operations of 
reconnecting the loops by permuting the end points of instances of the same 
link b. For details, the reader is referred to the literature. The basis of states is 
given by 

*G,p(^) =^c„im (63) 

Given the graph G, one obtains in this way a linearly independent set of gauge 
invariant functions of systems S with given graph G. 

The action of loop operators on such states has been defined in the litera- 
ture for loops with arbitrary number of "fat links" (dreibeins). In the above 
discussion of classical gravity, an exponentiated dreibein was used, because we 
had to mimic addition by use of the composition o. In quantum gravity, there 
is no need for this, because the linearity of the state space supplies an operator 
-|-. The Lie algebra valued dreibein can be used. 
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We may adopt the construction, but there remains the above mentioned 
second problem to be faced. We do not wish to consider the space as preexisting 
(e.g. as a manifold) with only a geometry that remains to to be put on it. 
Instead we want to define wave functions : S i-^ C on. arbitrary systems S. 
There is no way to formulate an intrinsic specification of an embedded Penrose 
spin network which makes sense for a completely arbitrary system S. But there 
are distinguished classes of such embedded spin networks and we may sum or 
average av over the representatives. Let me write av for this sum or average. If 
there is no representative for a given system S, the result is zero. 

The solution of the diffeomorphism constraint in quantum gravity through 
the use of knots ||2^, |2l) and its elaboration in terms of Penrose spin-networks 
[ p5| suggests what to do. The knot class K of an embedded spin network is 
an equivalence class of spin networks which share an intrinsic property. It is 
called an s-knot. Embedded networks which are obtained from each other by 
homotopic deformation of the paths represent the same s-knot. 

This leads us to try 

= NKaVpeK-9Graph(S)AS) (64) 

where Nk is an arbitrary normalization factor. These states are automatically 
diffeomorphism invariant. 

I wish to suggest a variation on this theme. Let us consider systems with 
a distinguished object O, called its root. The idea is that all the states are 
supposed to be subject to examination by one and the same observer which is 
somewhere and thus marks a point in space. Briefly, call O the observer. Loops 
from O to an object X and back may be interpreted as queries from O to X 
and answer back from X to O. The message can be influenced by the medium 
through which it passes. The construction above aims at labeling states by 
quantum numbers which record observable properties. If the observation is to 
be made by O, also the spin network which is to be embedded should have a 
distinguished node, and this node is to be mapped on O. This leads to "based 
knots". Parmenides and the idea of considering a quantum system together 
with its observer suggest, moreover, to require gauge invariance only under 
gauge transformations which are trivial at O and to admit functions tr which 
are invariant in this restricted sense only and which may depend on the observer. 
This enlarges the space of observables. 

Systems in quantum mechanics which include an observer are not usually 
considered in text books. But they play a role in recent experiments on quantum 
erasers |2^. These experiments show that the effect of a measurement on a 
quantum system can be erased again, under certain conditions. 
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Appendix A: Proof of the main representation 
theorem 

A slightly more elaborate version of the representation theorem was proven by 
the author in 0. For the convenience of the reader, the statement and proof of 
this theorem is reproduced here, and it is shown how the version in the main 
text, which is closer to lattice gauge theory, is obtained as a corollary. 

I use a slightly more elaborate notation than in the main text. Given the 
system K with objects X,Y, denote by Mor{Y, X) the set of arrows f : X ^ 
Y and g o f : X ^ Z for the composite of / G Mor{Y, X) with g e Mor{Z, Y). 



X : 


Ax ^- 


^ nx 


ix : 


-.nx ^ 


Ax 


/: 


: nx <- 


Ay 



Representation theorem 1 (Representation of a system as a communication 
network) Every system K permits a faithful representation with the following 
properties 

To every object X there exists an input space Ax and an output space Q.x ■ 
The input space contains a distinguished element ("empty input"). Arrows 
f £ Mor{Y, X), g G Mor{Z,Y) and objects X act as maps 

(65) 
(66) 
(67) 

with the properties 

Xlx = id : fix ^ , i-xX ^ id : Ax ^ Ax , (68) 

° / = gYf-.nx^Az. (69) 

It should be noted that lx does not act as the identity map in general in this 
context. 

Given this version of the representation theorem, we restrict attention to the 
output spaces ^Ix and to maps f = Y o f : Vlx ^ fty- Renaming / into / we 
obtain the representation theorem of the main text. 



Proof of the representation theorem |l] for categories 

Given a system K, we write MoriY, *) for the set of all its arrows to Y etc.. 
We define 

In{Y) = Mor{Y, *) , Out{Y) = Mor{*,Y) . 

We write X = a{f) if / £ Mor{Y,X) C In(Y), and correspondingly Z — Lo{f) 
if / € Mor{Z,Y) C Out{Y). The output space will be defined as a subspace 
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of . rjy consists of maps 

( : Outy Mor{*, *) 

with the property G Morxi^^if),*)- 
An object Y wiU act as a map 

y : In{Y) ^ fly 

according to 

Yfig) = gof (.9 e OutiY)). 

The output space is defined as the image of Y, and the input space as space of 
equivalence classes (if necessary) of elements of InxiY), which Y maps into the 
same C G ^T*- 

riy = iMYc np''* , (70) 

Ay = In{Y)/KER Y . (71) 

Y is invertible as a map from Ay to fly. Its inverse is Ly. The empty input 
G Ay is defined as the equivalence class of Ly e Mor{Y, Y) C IniY). 

An arrow / g Mor(Y, X) is defined as a map fix ^ Ay by use of the map 
ix '■ fly I— > Ay, as follows. 

/ - fo'.x, (72) 
- fog fovgeMor{X,*) . (73) 

The last formula defines / as a map from In{X) to In{Y). This map passes 
to equivalence classes (^Tj) thereby defining a map Ax i— > ^y, The composition 
rule {Mi holds. 



Appendix B: General Relativity in Ashtekar vari- 
ables 

For the convenience of the reader I will briefly review the canonical formalism 
for general relativity in Ashtekar variables. 

Before dealing with Ashtekar variables, let me briefly examine classical gen- 
eral relativity in order to see what a priori structural assumptions are made. 

In classical general relativity one deals with a 4-dimensional space time man- 
ifold M. and with a dynamically determined geometry on M . The geometry pro- 
vides a connection in the tangent bundle which is compatible with a Lorentzian 
metric. Field equations for the metric and the connection are derived from a 
variational principle. The vanishing of the torsion is one of these field equations. 

In the vierbein formalism, the connection in the tangent bundle can be 
thought to be constructed in two steps. 
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1. There is a connection in a vector bundle over M with fibers K ~ V'-i'^^ w 

R"^. This connection preserves a biUnear form <, >x of signature (H ) 

on the fibers. 

2. The fibers Vx are identified with the tangent spaces TxAi. 

is the 4-dimensional real representation space of the Lorentz group ("vec- 
tor representation"). 

The connection specifies parallel transporters. They are linear maps which 
preserve the bilinear form 

UiC) : Vx^Vy, (74) 
<U{C)v,U(C)w >y = <v,w>x (75) 

They transport vectors v,w £Vx along piecewise smooth paths C on from x to 

y- 

The identification is provided by a vierbein. It specifies an invertible map 
from the tangent space to the internal space ^° 

£{x) : TxM ^ Vx (76) 

for every x. By virtue of the identification, the bilinear form <,>x on the 
fibers becomes a Lorentz metric g on Ai, viz. g{X,Y) =< £{x)X,£{y)Y >x for 
X,YGTxM. 

In this manner, general relativity appears as a gauge theory with gauge 
group isomorphic to the Lorentz group 50(3, 1) and with a distinguished field, 
the vierbein field. The action has a particular form. 

The standard description is obtained by introducing coordinate systems on 
charts of M and a moving frame on each chart. The moving frame specifies a 
pseudo-orthonormal basis f (x) = {fo{x), fi{x), f2{x), fsix)) of Vx for every x in 
the chart. Pseudo-orthogonality reads 

< fa{x),ff3{x) >x= riai3 

with r]a/3 = diag{+l, —1, —1, —1). The moving frame serves to convert linear 
maps into matrices. 

The pseudo-orthonormal frames f{x) form the fibers of a principal fiber 
bundle whose structure group is the Lorentz group SO{3, 1). Parallel transport 
of vectors induces parallel transport of frames and thereby a connection on a 
principal fiber bimdlc. 

The coordinate system specifies a basis in the tangent spaces TxM. Ex- 
panding everything in sight, one gets the components of the vierbein and of 
its inverse , the components of the metric tensor, and the parallel transport 
matrices U(C) G SO{3, 1) with entries J7^. 

Si^)d^ = E^^{x)Ux), (77) 

^''Beware of confusion. This is not the same £ as in the main text. 
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£{xr\Ux) = EJ^{x)d^, (78) 
E^'VcpEf = g^,{x) (79) 

u{c)Ux) = f0{y)ui{c). (80) 

The moving frame serves to convert linear maps into matrices. The parallel 
transport matrix U(C) for infinitesimal paths C from a point x with coordinates 
to a neighboring point with coordinates x^ + 6x^^ defines the vector potential 

u(c) = i-r^(x)fe'^. (81) 

The entries of the vector potential are also known as the connection coefficients 
in the anholonomic basis provided by /„. 

The matrix R^^ whose entries are the anholonomic components R^p^^,^ of the 
field strength- or curvature-tensor J^ij,u{x) ■ Vx Vy are given by the standard 
formula 

:F^A^)Ux) = fp{x)RP^^,{x), (82) 
R-Aii/ = dfiTi, — dvT II -\-T fjXi, — Vi,T (83) 

One may compute the parallel transporter hi{C) around an infinitesimal triangle 
C = A whose corners have coordinates {a;'*}, {a;'* + Syi^}, {a;'* + Szi^}. The result 
can be stated in basis independent form as 

ZY(A) = 1 - ]^J^„,{x)5y<'6z'' + ... . (84) 

The differentials 5y^^, Sz" should be regarded as anti-commuting. 

Gauge transformations are determines by matrices 5(a;) = (S"^(a;)) e 50(3,1). 
A (passive) gauge transformation is a change of moving frame 

Ux) ^ Mx)S^^{x). (85) 

This transformation preserves pseudo-orthonormality. The parallel transport 
matrix, vierbein components and vector potential transform in the familiar way 
under such gauge transformations. 

Connections in spinor space. Ashtekar variables 

The Ashtekar variables appear very naturally if one starts from parallel trans- 
port of spinors rather than 4- vectors. Such parallel transport of spinors must 
be considered anyway when one wants to describe matter by wave functions 
for spin ^ particles. The gauge group is then the quantum mechanical Lorentz 
group 5i(2,C). 

Because of the structural assumptions of the standard theory, the parallel 
transport of vectors in an arbitrary representation space of the structure group 
determines the parallel transport of vectors in any representation space. 
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The fibers Vj' « 1/(2'°) « and ~ y(°'2) « are now isomorphic 
to 2-dimensional complex representation spaces of SL{2, C). One has 

1/(^3) = y(io) 0T/(o4). (86) 

More precisely, V^^'^) is a real subspace of the complex representation space 
y(io) ^ -{/("'s). This identification can serve to construct a moving frame in 
a moving frame in 1^*^212 ). The parallel transport matrices 
u(C) e S'L(2,C) for vectors in V+ w V^'^.o) and [/(C) G 5*0(3,1) for vectors 
in Vx ~ 1/(2.2) are related by the fundamental formula of spinor calculus, 

Aa^A* = (TpA'^^iA) for A e S'L(2,C), (87) 
Xcr„+cr„X* = cr0A^„(X) for X G sZ(2, C). (88) 

<Ti are the Pauli matrices for i — 1,2,3 and cro = 1 (2 x 2 identity matrix). 
This formula yields the Lorentz transformation A (A) which is associated with 
A e SL{2, C), and similarly for elements of the Lie algebra sl{2, C). 

We will use boldface letters to characterize complex 2x2 matrices through- 
out. 

The parallel transport matrices u(C), the vector potential a;^ e sl{2,C) 
and the field strength matrix F^^ e sl{2, C) in spinor space obey the following 
relations 

u(C)(T„u(C)* = cT0U{Cf^, (89) 
F^^{x)cTa + cTo,F^^{xy = (TfiRf^^^^ix) (90) 
u(C) = l-u;,,(x)(5a;^ + ... (91) 
Fpi. = Spu;,, - a^Wp + u;pu;y - (92) 

One converts also the vierbein £ to a spinorial basis. For a fixed /i, the vector 
£{x)d^ € Vx- Because of the identification ( p6| ) of representation spaces, we 
may also regard £{x)d^ as an element of (g) V~. 

To obtain convenient formulas, one introduces ai = —(Ji (i = 1,2,3) and 
o^o = Co so that 



t- ,-1 



2?7m-> (93) 

(94) 



where e is the completely antisymmetric tensor in two dimensions, and *cr is 
the transpose of the matrix cr. 

We introduce two (hermitian) 2 x 2-niatrices E*^ and 

as spinorial versions of the inverse vierbein (EJ^). 

Ef = tr (<T„E^) = tr (5-„E^) , (95) 
afsE'^f" = E'^. (96) 
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The Einstein action will involve the volume form E{x)d'^x with 



E = det{E^^) = det{EJ^)-' = ^{- g). 



Using the relation (^0|) of the field strength and the definitions of the spinorial 
versions of the vierbein one finds 



Einstein action = j d'^xL; (97) 
L - EE^^^Ef^'^R^p^, (98) 
= £;tr|F^^E''E^ + E*''F*^E*'^|. (99) 



By a choice of gauge, i.e. of a suitable moving frame, the inverse vierbein is 
brought to the form 

e: 



(100) 



It follows that 

E^Ne, e^det{e^y^. (101) 

TV and iV™ are known as lapse and shift functions, and (e^^') is the inverse 
dreibein, with holonomic indices i — 1,2,3 and anholonomic indices a = 1,2,3. 
Its spinorial version e' is given by 

e; - \tT{aa^), (102) 

E^ = e^+E^oVo. (103) 

Finally one introduces 

N_ = iVe"^ (104) 

= ee\ (105) 

Consider now a space-like surface x'^ = t. The parallel transport along curves 
C within the surface is determined by the space components of uit of the vector 
potential. 

The Ashtekar variables are the 2x2 matrices oji and e*. 



Canonical formalism 

Dirac's canonical formalism with constraints is used to bring the equation of 
motion to Hamiltonian form and to determine constraints on the initial data. 
There is a subtle point, however. Ashtekar applies the canonical formalism 
to a theory with a gauge group SL{2,C) x S'i(2,C) ("complex relativity"). 
This means that left handed and right handed spinors have independent parallel 
transporters, with independent vector potentials oj and oj* , and there are also 
two independent vierbein variables E and E* . This is quite natural from the 
point of view of the philosophy of this paper. To compare the left and right 
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handed spinors, one would need to assume that there exists an invariant op- 
eration, complex conjugation, which commutes with parallel transport. It is 
natural to consider the existence of such an invariant and an associated rela- 
tion between parallel transporters as a distinguishing feature of initial states in 
general relativity, but not as a priori structure. 

Dirac's formalism is described in detail in the text book ||2^. In the course 
of the analysis, first class and second class constraints appear. The second class 
constraints must be imposed as strong constraints, i.e. they are equalities on 
all the phase space. 

As a warm up exercise the reader may consider the following nonstandard 
form of the Maxwell action 

LMa.n.eii ^- j d''x\^{d,,A, ~ d,A^)F>''' - i^^^.F^-^j (106) 

The condition Fij — diAj — djAi arises in the form of a second class constraint. 
The standard Hamiltonian is obtained and the Gauss law is a first class con- 
straint. 

Let us return to the Einstein action, a;^ are complex traceless 2x2 matrices. 
Let me repeat: Ashtekar deviates from the rules in the book in an important 
way. He starts with the assumption that the sZ(2,C) vector potentials a; and 
w* are independent variables to begin with, and also the dreibein variables 
which come to multiply F^j^ and F* in the formula for the Einstein action are 
independent variables. In other words, he starts with complex general relativity; 
the reality constraints are only imposed to select among the solutions. 

Proceeding in this way, attention is restricted to the first term in the Ein- 
stein action ( p9| ) which involves a;^. The first step is the determination of the 
conjugate variables tt^ . One finds 

*7r^ = — =e( E^E° - E^E^") (107) 

ddou;^ V / 

*cr stands for the transpose of a 2 x 2 matrix er again. 

One deduces from this that tt*^ = 0. In addition one finds Ashtekar's cel- 
ebrated result that vector potential and the "densitized" inverse dreibein are 
conjugate variables 

tt' 2e\ (108) 

Proceeding further according to the rules in the book, the canonical Hamiltonian 
He is obtained as a sum of three terms, all of which are first class constraints. 
This means that they vanish on the constraint surface. 

He = -NH - 2N'H, - 2G, (109) 

H tr (Fy-e'e^) , (110) 

H, =. tr{F,,e'), (111) 

G = trluioDie') . (112) 
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Summation are from 1 to 3. The hermitian conjugate h.c. involves the adjoint 
matrices F*^, and uJq; the dreibein matrices e* are self adjoint. 

This is the second one of Ashtekar's celebrated results. It exhibits the con- 
straints in polynomial form. TV, TV* and the matrix lvq arc Lagrange multi- 
pliers; they have zero conjugate variables. The covariant derivative has the 
conventional form 

Dia = dia+[ui,(T]. (113) 

Time development is from space-like hypcr-siirfacc to space-like hypcr-surfacc. 
There is freedom in choosing the foliation into hyper-surfaces and of Lorentz 
gauge transformations. This is reflected in the freedom of choice of the Lagrange 
multipliers. We choose 

N'=0, uja=0. (114) 

The field equations are obtained with the help of the following lemma. Suppose 
A'^ is independent of u>k- Then 

—tr (A'^Fij) = * (DjA''^ - DjA'^) . (115) 



One obtains 



2wfe = ^=iV[Ffci,e^ (116) 



2e' = -^=NDi[e\y\ (117) 
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